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Abstract 

The present paper considers distributed consensus algorithms that involve TV agents evolv- 
ing on a connected compact homogeneous manifold. The agents track no external reference 
and communicate their relative state according to a communication graph. The consensus 
problem is formulated in terms of the extrema of a cost function. This leads to efficient gradi- 
ent algorithms to synchronize (i.e. maximizing the consensus) or balance (i.e. minimizing the 
consensus) the agents; a convenient adaptation of the gradient algorithms is used when the 
communication graph is directed and time-varying. The cost function is linked to a specific 
centroid definition on manifolds, introduced here as the induced arithmetic mean, that is eas- 
ily computable in closed form and may be of independent interest for a number of manifolds. 
The special orthogonal group SO{n) and the Grassmann manifold Grass(p,n) are treated as 
original examples. A link is also drawn with the many existing results on the circle. 

1 Introduction 

The distributed computation of means/averages of datasets (in an algorithmic setting) and the 
synchronization of a set of agents (in a control setting) — i.e. driving all the agents to a common 
point in state space — are ubiquitous tasks in current engineering problems. Likewise, spreading 
a set of agents in the available state space — linked to the definition of balancing in §4 — is a 
classical problem of growing interest. Practical applications include autonomous swarm/formation 



operation (e.g. [4^; l28!; [22; l23l : l25l | ) , distributed decision making (e.g. [35|; l47|), neural and 
communication networks (e.g. Gflj3), clustering and other reduction methods (e.g. [I3|)> 
optimal covering or coding (e.g. UUlHlj ) ^'^^ other fields where averaging/synchronizing or 
distributing a set of points appear as sub-problems. In a modeling framework, the understanding 
of synchronization or more generally swarm behavior has also led to many important studies (e.g. 

[Uliilii). 



Synchronization algorithms are well understood in Euclidean spaces (e.g. [33|; |32|; |47|; l35l|). 
They are based on the natural definition and distributed computation of the centroid in R"*. 
However, many applications above involve manifolds that are not homeomorphic to an Euclidean 
space. Even for formations moving in or R'^, the agents' orientations evolve in a manifold 
S0{2) = iS*^ or 50(3). Balancing only makes sense on compact state spaces; though many 
theoretical results concern convex or star-shaped subsets of R™ (e.g. [l3|)j most applications 



* Department of Electrical Engineering and Computer Science (Montefiore Institute), University of Liege, Sart- 
Tilman Bldg. B28, B-4000 Liege, Belgium {alain.sarletteaulg.ac.be, r. sepulchreSulg. ac.be). 



1 



SIAM/SICON preprint: subm 09/2006; rev 11/2007; acc 04/2008; publ? > 2009. 



2 



involve compact manifolds. It seems that the study of global synchronization or balancing in 
non-Euclidean manifolds is not widely covered in the literature, except for the circle. 

The present paper proposes algorithms for global synchronization and balancing — grouped 
under the term consensus — on connected compact homogeneous manifolds. A homogeneous 
manifold A4 is isomorphic to the quotient of two Lie groups. Intuitively, it is a manifold on 
which "all points are equivalent" . This makes the problem symmetric with respect to the absolute 
position on the manifold and allows to focus on configurations of the swarm, i.e. relative positions 
of the agents. 

The main idea is to embed M in M™ and measure distances between agents in M™ in order to 
build a convenient cost function for an optimization-based approach. The related centroid on Ai 
may be interesting on its own account; it is therefore studied in more detail in §3. 

Throughout the paper, the abstract concepts are illustrated on the special orthogonal group 
SO{n), the Grassmann manifold Grass{p,n) of p-dimensional vector spaces in M" and sometimes 
the circle S^, which is in fact isomorphic to both 5*0(2) and Grass{l, 2). Other manifolds to which 
the present framework could be applied include the n-dimensional spheres 5" and the connected 
compact Lie gro ups. The circle is the simplest example; it links the present work to existing 
results in [13; llil; [Hi; [iil. SO{n) is important in control applications as the natural state space for 



orientations of n-dimensional rigid bodies. Grass{p, n) rather appears in algorithmic problems; H | 
mentions the optimal placement of N laser beams for cancer treatment and the projection of multi- 
dimensional data on N representative planes as practical applications of optimal distributions on 
Grass{p, n). 

The paper is organized as follows. Previous work is briefly reviewed in §1.1. Section 2 intro- 
duces concepts and notations about graph theory, SO(n) and Grass{p,n). Section 3 is devoted 
to the induced arithmetic mean. A definition of consensus is presented in §4. Section 5 introduces 
a cost function to express the consensus problem in an optimization setting. Section 6 derives 
gradient algorithms based on this cost function, the only communicated information being the 
relative positions of interconnected agents; convergence is proved for any connected, fixed and 
undirected communication graph. Algorithms whose convergence properties can be guaranteed 
under possibly directed, time-varying and disconnected communication graphs are presented in 
§7; they employ an auxiliary variable that evolves in the embedding space M™. 

1.1 Previous work 

Most of the work related to synchronization and balancing on manifolds concerns the circle . 



The most extensive literature on the subject derives from the Kuramoto model (see 4J] for a 
review). Recently however, synchronization on the circle has been considered from a control 
perspective, the state variables representing headings of agents in the plane. Most results cover 
local convergence p2l:l33l|. An interesting set of globally convergent algorithms in SE{2) = ^''^ x 
is presented in [4^, but they require all-to-all communication. Some problems related to global 
discrete-time synchronization on under different communication constraints are discussed in 
[3^ . where connections of the control problem with various existing models are made. Stronger 
results are presented in [s^] for global synchronization and balancing on with varying, directed 
communication links, at the cost of introducing estimator variables that communicating agents 
must exchange. Finally, [i^ presents results on SE(2) similar to those of [42,1 but under relaxed 
communication assumptions, using among others the estimator strategy of jsl; \^ . 

Several authors have already presented algorithms that asymptotically synchronize satellite 
attitudes, involving the rotation group SO (3). They often rely on tracking a common external 
reference (e.g. [27]) or leader (e.g. [sl: l25|). The use of the convenient but non-unique quaternion 
representation for 5*0(3) produces unwanted artefacts in the satellites' motions. Attitude synchro- 
nization without common references and quaternion artefacts is studied in [3_4;]; using the same 
distance measure as the present work, an artificial coupling potential is built to establish local 
stability. All these approaches explicitly incorporate the second-order rigid-body dynamics. In 
accordance with the consensus approach, the present paper reduces the agents to first-order kine- 
matic models to focus on (almost) global convergence for various agent interconnections, without 
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any leader or external reference. Application of this framework in a mechanical setting is discussed 
in a separate paper (ssj . 

Synchronization or balancing on a manifold A4 is closely related to the definition and compu- 
tation of a mean or centroid of points on A^, a basic problem that has attracted somewhat more 
attention, as can be seen from [ISSIIS] among others. 

A key element of the present paper is the computation of a centroid in the embedding space 
R™ of Ai, which is then projected onto Ai. This is connected to the "projected arithmetic mean" 
defined in [3l[ for 5*0(3). In fact, the computation of statistics in a larger and simpler embedding 
manifold (usu ally Euclidean space) and projecting the result back onto the original manifold, goes 
back to 1972 [Tl]. 

A short example in [H addresses the computation of a "centroid of subspaces" , without much 
theoretical analysis; in fact, our algorithms on Grass{p, n) are similar and can eventually be viewed 
as generalizing the developments in in the framework of consensus and synchronization. More 
recently, [17|] uses the centroid associated to the projector representation of Grass{p,n), exactly 
as is done below but without going into theoretical details, to compute the cluster centers in a 
clustering algorithm. The distance measure associated to this centroid on Grass{p, n) is called 
the chordal distance in 11; ^ where it is used to derive optimal distributions ("packings") of N 
agents on some specific Grassmann manifolds. 

Finally, the topic of optimization-based algorithm design on manifolds has considerably devel- 
oped over the last decades (see e.g. 0, [3] and the books [isl: Bp. 



2 Preliminaries 

2.1 Elements of graph theory 

Consensus among a group of agents depends on the available communication links. When consid- 
ering limited agent interconnections, it is customary to represent communication links by means 
of a graph. The graph G is composed of N vertices (the N agents) and contains the edge [j, k) if 
agent j sends information to agent k, which is denoted j k. A positive weight ajk is associated 
to each edge (j, k) to obtain a weighted graph; the weight is extended to any pair of vertices by 
imposing ajk = iff {j, k) does not belong to the edges of G. The full notation for the resulting 
digraph (directed graph) is G{V,E,A) where V = {vertices}, E — {edges} and A, containing the 
ajTc, is the adjacency matrix. The convention akk = Vfc is assumed for the representation of 
communication links. 

The out-degree of a vertex k is the quantity — X^jLi '^kj of information leaving k towards 
other agents; its in-degree is the quantity d^*-* — J2jLi ^jk of information received by k from other 
agents. These degrees can be assembled in diagonal matrices D'^"-' and D^^^\ A graph is balanced 
if d(°) = D(0. This is satisfied in particular by undirected graphs, for which A = A^ . A graph is 
bidirectional if (j, k) £ E 'i^ S E (but not necessarily A = A'^). 

The Laplacian L of a graph is L — D ~ A. For directed graphs, I?*^*-' or can be used, leading 
to the in-Laplacian L^*) = D^*^ — A and the out-Laplacian L^^"^ — D'^"^ — A. By construction, L^*) 
has zero column sums and L(°) has zero row sums. The spectrum of the Laplacian refiects several 
interesting properties of the associated graph, specially in the case of undirected graphs (see for 
example [10]). 

G'(y, E, A) is strongly connected if it contains a directed path from any vertex j to any vertex 
I (i.e. a sequence of vertices starting with j and ending with / such that (vk^Vk+i) G E for any 
two consecutive vertices w^, Wfc+i); G is weakly connected if there is such a path in the associated 
undirected graph, with adjacency matrix A ~\- A^ . 

For time-varying interconnections, a time-varying graph G{t) is used and all the previously 
defined elements simply depend on time. If the elements of A{t) are bounded and satisfy some 
threshold ajk{t) > S > V(j, k) e E{t) and \/t, then G{t) is called a 5-digraph. The present paper 
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always considers J-digraphs. 

In a (5-digraph G{V, E, A), vertex j is said to be connected to vertex k across [ii, if there is 
a path from j to k in the digraph G{V, E, A) defined by 



G can be seen as a time-integrated graph while the ^-criterion prevents vanishing edges. A S- 
digraph G(t) is called uniformly connected if there exist a vertex k and a time horizon T > such 
that yt, k is connected to all other vertices across [t, t + T]. 

2.2 Specific manifolds 

The concepts presented in this paper are illustrated on two particular manifolds: SO{n) and 
Grass{p, n). 

The special orthogonal Lie group SO{n) It can be viewed as the set of positively oriented 
orthonormal bases of R", or equivalently as the set of rotation matrices in M"; it is the natural 
state space for the orientation of a rigid body in R". In its canonical representation, used in 
the present paper, a point of SO{n) is characterized by a real n x n orthogonal matrix Q with 
determinant equal to +1. SO{n) is homogeneous (as any Lie group), compact and connected. It 
has dimension n(n — l)/2. 

The Grassmann manifold Grass{p, n) Each point on Grass{p, n) denotes a p-dimensional 
subspace 3^ of M". The dimension oi Grass{p,n) is pln — p). Since Grass{n—p,n) is isomorphic to 
Grass{p, n) by identifying orthogonally complementary subspaces, this paper assumes without loss 
of generality that p < ^. For the special case p = 1, the Grassmann manifold Grass{l, n) is also 
known as the projective space in dimension n. Grass{p, n) is connected, compact and homogeneous 
as the quotient of the orthogonal Lie group 0{n) by 0{p) x 0{n — p). Indeed, y e Grass{p, n) can 
be represented for instance by a (not necessarily positively oriented) orthonormal basis Q G 0{n) 
whose first p column-vectors span 3^; the same point y g Grass{p, n) is represented by any Q 
whose first p column- vectors span y (O(p)-symmetry) and whose last n — p column- vectors span 
the orthogonal complement of y {0{n — p)-symmetry). Other quotient structures for Grass{p, n) 
are discussed in [l*! . 

A matrix manifold representation of Grass{p,n) found in [1] assigns to y any n x p matrix 
Y oi p orthonormal column- vectors spanning y (p-basis representation) ; all Y corresponding to 
rotations and reflections of the p column- vectors in y represent the same y (O(p)-symmetry), 
so this representation is not unique. The dimension of this representation is np — p{p + l)/2. 
In poj . a point of Grass{p,n) is represented by 11 = YY'^ ^ the orthonormal projector on y 
(projector representation); using the orthonormal projector on the space orthogonal to 3^, nj_ = 
In — Y where /„ denotes the nx n identity matrix, is strictly equivalent. The main advantage 
of this representation is that there exists a bijection between Grass{p, n) and the orthonormal 
projectors of rank p, such that the projector representation makes Grass{p, n) an embedded 
submanifold of the cone of n x n symmetric positive semi-definite matrices. A disadvantage of 
this representation is its large dimension ri(n + l)/2. 




3 The induced arithmetic mean 



A homogeneous manifold is a manifold with a transitive group action by a Lie group Q: it is 
isomorphic to the quotient manifold G/Ti. of a group Q by one of its subgroups Ti.. Informally, 
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it can be seen as a manifold on which "aU points are equivalent" . The present paper considers 
connected compact homogeneous manifolds satisfying the following embedding property. 

Assumption 1 A4 is a connected compact homogeneous manifold smoothly embedded in R™ with 
the Euclidean norm \\y\\ — tm constant over y Cz A4. The Lie group Q acts as a subgroup of the 
orthogonal group on M™ . 

It is a well-known fact of differential geometry that any smooth ^-dimensional Riemannian 
manifold can be smoothly embedded in R™. The additional condition |jy|| = r^vt is in agreement 
with the fact that all points on Ai should be equivalent. It is sometimes preferred to represent 
?/ e by a matrix B E K"i ><"2 instead of a vector. Componentwise identification xti2 ^ j^m 
assumed whenever necessary; the corresponding norm is the Frobenius norm \\B\\ = ■\/trace(i?^i?). 

Consider a set of N agents on a manifold M satisfying Assumption [T] The position of agent 
k is denoted by yk and its weight by Wk- 

Definition 1 The induced arithmetic mean lAM <Z M. of N agents of weights Wk > and 
positions yk E M, k ^ 1...N , is the set of points in M. that globally minimize the weighted sum of 
squared Euclidean distances in R™ to each yk'. 

IMI = argminX;^^! Wfc dR™(2/fe,c) = argmin^^f^i «^fc (j/fc - c)^(j/fc - c) . (1) 

c£M ceM 

The anti- [induced arithmetic mean] AIAM C /A is the set of points in Ai that globally maxi- 
mize the weighted sum of squared Euclidean distances in R™ to each yk'. 

AIAM = argmaxX^f^i iffc c^E™(2/fc,c) = argmaxX^f^i Wfc (j/fc " c)^(?/fe - c) . (2) 
ceM ceM 

The terminology is derived from (sij where the lAM on 5*0(3) is called the projected arithmetic 
mean. The point in Definition [T] is that distances are measured in the embedding space R™. 
It thereby differs from the canonical definition of mean of N agents on the Karcher mean 
which uses the geodesic distance dM along the Riemannian manifold M. (with, in 



the present setting, the Riemannian metric induced by the embedding of Ad in R™): 

C Karcher = argmiu X^fcLl Wk d\^{yk, c) . 

The induced arithmetic mean has the following properties. 

1. The lAAI of a single point yi is the point itself. 

2. The lAM is invariant under permutations of agents of equal weights. 

3. The lAM commutes with the symmetry group of the homogeneous manifold. 

4. The lAM does not always reduce to a single point. 

The last feature seems unavoidable for any mean (including the Karcher mean) that satisfies the 
other properties. The main advantage of the lAM over the Karcher mean is computational. The 
lAAI and AIAAI are closely related to the centroid. 

Definition 2 The centroid Ce G R™ of N weighted agents located on Ai is 
Ce = W SfeLi «^fc Vk , where W = Y.k=i ■ 
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Since ||c|| = tm for c e by Assumption [U equivalent definitions for the lAM and MAM are 



Hence, computing the lAM and AIAAI just involves a search for the global maximizers of a linear 
function on R™ in a very regular search space A4 . Local maximization methods would even suffice 
if the linear function had no maxima on M other than the global maxima. This is the case for 
any linear function on SO{n) and Grass{p,n) (see §3.1) as well as the n-dimensional sphere S" 
in Not knowing whether this property holds for all manifolds satisfying Assumption [l] we 

formulate the following blanket assumption. 

Assumption 2 The local maxima of any linear function f{c) — c^ b over c € Al, with b fixed in 
R™, are all global maxima. 

3.1 Examples 

These examples exclusively consider the lAM; from ([3]), the conclusions for the MAM are simply 
obtained by replacing Ce with — Cg. 

The circle The circle embedded in with its center at the origin satisfies Assumptions [T] and 
[2j The lAM is simply the central projection of Ce onto the circle. Hence it corresponds to the 
whole circle if Ce = and reduces to a single point in other situations. The lAM uses the chordal 
distance between points, while the Karcher mean would use arclength distance. 

The special orthogonal group The embedding of SO{n) as orthogonal matrices Q G R"^", 
det((5) > 0, satisfies Assumption [1] since \\Q\\ = ^trace((5^Q) ~ \pn. It also satisfies Assumption 
[5] (proof in §6). Ce = is a general nx n matrix. The lAM is linked to the polar decomposition 

of Ce. Any matrix B can be decomposed into \JR with U orthogonal and i? symmetric positive 
semi-definite; K is always unique, U is unique if B is non-singular [7]. Each f/ is a global minimizer 
of d^^y.^{c, B) over c g 0(n). Thus, if det(Ce) > 0, the IMI contains aU matrices U: det{U) > 
obtained from the polar decomposition of Cg; this was already noticed in ^31]. When det(Ce) < 0, 
the result is more complicated but still has a closed-form solution. 

Proposition 1 Consider U an orthogonal matrix obtained from the polar decomposition Ce = 
UR. The lAAI of N points on SO{n) is characterized as follows. 

//det(Ce) > 0, then lAM = {U : det(U) > 0}. It reduces to a single point if the multiplicity 
of as an eigenvalue of Ce is less or equal to 1. 

//det(Ce) < 0, then lAM = {UHJH^} where det{U) < 0, H contains the orthonormalized 
eigenvectors of R with an eigenvector corresponding to the smallest eigenvalue of R in the 



lAM — argmax(c"^ Ce) and MAM — argmax(— c"^ Ce) . 



(3) 





n-l 




reduces to a single point if the smallest 



eigenvalue of R has multiplicity 1. 



Proof: It is provided in §6 after introducing further necessary material to compute the critical 
points of c^Ce, among which the local maxima are selected. □ 
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The Grassmann manifold The representation of Grass{p, n) with p-bases Yk is not an em- 
bedding and cannot be used in the proposed framework, because the p-dimensional subspace of 
R" spanned by the columns of Ce = J2k would depend on the particular matrices Yfc chosen 
to represent the subspaces yk- The lAAI is defined with the projector representation, embedding 
Grass{p, n) in §+. The latter satisfies Assumption [TJ the Frobenius norm of a p-rank projector is 
y/p. It also satisfies Assumption [2] (proof in §6). The centroid Ce of N projectors is generally a 
symmetric positive semi-definite matrix of rank > p. 

Proposition 2 The lAhl contains all dominant p-eigenspaces oj Ce- It reduces to a single point 
if the p-largest and {p + l)-largest eigenvalues of Cg are different. 

Proof: Following the same lines as for SO{n), it is postponed to §6. □ 

In fact, for y e Crass{p, n) with a p-basis Y and the projector Hy — YY'^ , the cost function 
in ([3]) becomes 

fiUy) = trace(n3;Ce) = trace(r'^CeF) = trace((r'^y)"i Y^CeY) (4) 

where the last expression is equal to the generalized Rayleigh quotient for the computation of the 
dominant p-eigenspace of Cg. The computation of eigenspaces from cost function Q is extensively 
covered in [H; Q ■ Furthermore, it is a well-known fact of linear algebra that the p largest eigenvalues 
(the others being 0) of nj;nfc are the squared cosines of the principal angles 0^, i = l...p, between 
subspaces y and yk- This provides a geometrical meaning for the lAM of subspaces: it minimizes 
the sum of squared sines of principal angles between the set of subspaces yt, k = 1...N, and a 
centroid candidate subspace y, i.e. lAM = argmin-y X^fc^Sf^i sin^(^f'fe)- The Karcher mean 
admits the same formula with sin^(0^) replaced by (i/*);)^ 

4 Consensus 

Consider a set of agents with positions yk, k — 1...N, on a manifold A4 satisfying Assumption 
[TJ The rest of this paper assumes equal weights Wk — I Vfc; extension to weighted agents is 
straightforward. Suppose that the agents are interconnected according to a fixed digraph C of 
adjacency matrix A — [ajk]- 

Definition 3 Synchronization is the configuration where y.j = yk yj,k. 

Definition 4 A consensus configuration with graph G is a configuration where each agent k 
is located at a point of the lAhl of its neighbors j ~^ k, weighted according to the strength of the 
corresponding edge. Similarly, an anti- consensus configuration satisfies this definition with lAM 
replaced by MAM . 

Consensus: yk G argmax ic^ Y^^=i %fe Vj ) ^k . (5) 

Anti- consensus: yk G argmin (c^ X^^Li Qjfe Vj) Vfc . (6) 

ceM \ ^ I 

Note that consensus is defined as a Nash equilibrium: each agent minimizes its cost function 
assuming the others fixed; the possibility to decrease cost functions by moving several agents 
simultaneously is not considered. Consensus is graph-dependent: agent fc reaches consensus when 
it minimizes its distance to agents j ^ fc. 

Proposition 3 If G is an equally-weighted complete graph, then the only possible consensus con- 
figuration is synchronization. 
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Proof: At consensus the yk satisfy vl ^j^kVi — ^j^kVi Vfc and Vc e A^. Further- 
more, it is obvious that y^Hk > VkC for any c € \ {uk}- As a consequence, ?/J T^^=i Vj ^ 
X^jLi Vj Vc e \ {uk} and Vfc. Thus according to Q, each is located at the IMI of all 
the agents, which moreover reduces to a single point; thus yk = yj = IAM{{yi : I = 1...7V}) Vfc, j. 
□ 

Synchronization is a configuration of complete consensus. To similarly characterize a configu- 
ration of complete anti-consensus, it appears meaningful to require that the lAM of the agents is 
the entire manifold A4; this is called a balanced configuration. 

Definition 5 N agents are balanced if their lAM contains all A4 . 

Balancing implies some spreading of the agents on the manifold. A full characterization of bal- 
anced configurations seems complicated. Balanced configurations do not always exist (typically, 
when the number of agents is too small) and are mostly not unique (they can appear in qualita- 
tively different forms). The following link exists between anti-consensus for the equally- weighted 
complete graph and balancing. 

Proposition 4 All balanced configurations are anti- consensus configurations for the equally-weighted 
complete graph. 

Proof: For the equally-weighted complete graph, ([6]) can be written 

Vk e argmin (c^ (N Ce - yk)) Vfc . (7) 
ceM 

Assume that the agents are balanced. This means that /(c) — <F Ce must be constant over c G Al. 
Therefore ([7]) reduces to yk — yk Vfc which is trivially satisfied. □ 

In contrast to Proposition [31 Proposition 2] does not establish a necessary and sufficient condi- 
tion; and indeed, anti-consensus configurations for the equally-weighted complete graph that are 
not balanced do exist, though they seem exceptional. 

4.1 Examples 

The following examples illustrate among others the last assertions about balanced configurations. 

Tiie circle Anti-consensus configurations for the equally-weighted complete 
graph are fully characterized in [i^ . It is shown that the only anti-consensus configurations 
that are not balanced correspond to {N + l)/2 agents at one position and (A'' — I)/2 agents at 
the opposite position on the circle, for N odd. Balanced configurations are unique for N = 2 and 
A^ = 3 and form a continuum for A^ > 3. 

Another interesting illustration is the equally-weighted undirected ring graph in which each 
agent is connected to two neighbors such that the graph forms a single closed undirected path. 
Regular consensus configurations correspond to situations with consecutive agents in the path 
always separated by the same angle < x < 7r/2; regular anti-consensus configurations have 
7r/2 < X ^ III addition, for A^ > 4, irregular consensus and anti-consensus configurations 
exist where non-consecutive angles of the regular configurations are replaced by (tt — x). As a 
consequence: 

1. Several qualitatively different (anti-)consensus configurations exist. 
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2. Consensus and anti-consensus configurations can be equivalent when discarding the graph. 
For example, the positions occupied by 7 agents separated by 2tt/7 (consensus) or An/ 7 
(anti-consensus) are strictly equivalent; the only difference, based on which agent is located 
at which position, concerns the way the links are drawn. 

3. Degenerate configurations of simultaneous consensus and anti-consensus exist (e.g. x = 7r/2 
for = 4,8, ...); this singularity is specific to the particular graph. 

4. There is no common anti-consensus state for all ring graphs. Indeed, considering an agent 
fc, a common anti-consensus state would require that any two other agents, as potential 
neighbors of k, are either separated by tt or located at both sides of A; at a distance x > 7r/2; 
one easily verifies that this cannot be satisfied for all k. 



The special orthogonal group Simulations of the algorithms proposed in this paper suggest 
that balanced configurations always exist for iV > 2 if n is even and for > 4 if n is odd. Under 
these conditions, convergence to an anti-consensus state that is not balanced is not observed for 
the equally-weighted complete graph. 

The Grassmann manifold Balanced states on Grass{p, n) appear if all eigenvalues of Ce are 
equal. Since trace(Ce) = J^k trace(nfc) = p, this requires Ce = ^/n. This is not always possible 
with N orthonormal projectors of rank p. As for SO{n), simulations tend to indicate that it is 
possible when N is large enough; however, computing the minimal value of N for a given n and p 
is not straightforward. 

5 Consensus optimization strategy 

The presence of a maximization condition in the definitions of the previous sections naturally 
points to the use of optimization methods. The present section introduces a cost function whose 
optimization leads to (anti-)consensus configurations. For a graph G with adjacency matrix A = 
[ajk] and associated Laplacian L*^*) = [Z^^] and the variable y = (yi, ypf) g define 

PLiy) = 2^Ef=i Ej^i o.jk yfvk = - jhT,k=i T.'jLi o-jk hj - Vkf (8) 

2 

with constant = £^ aj^. The index L refers to the fact that ([5]) can also be written as 

a quadratic form on the graph Laplacian: 

Pdy) = 6 - 2F^Ef=i Ef=i ^1^ yjvk with constant 6 = ^Efe 4^^ • (9) 

In [s^l and [i^l, this form of Pl is studied on the circle for undirected equally- weighted graphs. 
For the unit-weighted complete graph, P :— Pi^ + ^ equals 

P{y) = k\\Cef, (10) 

proportional to the squared norm of the centroid Ce ■ This is a classical measure of the synchrony 
of phase variables on the circle S^, used for decades in the literature on coupled oscillators; in the 
context of the Kuramoto model, P{y) is known as the "complex order parameter" (because 
is usually identified with C in that context). In [S^], P is used to derive gradient algorithms for 
synchronization (by maximizing (jlOp ) or balancing (by minimizing (jlOp ) on . 

Proposition 5 Synchronization of the N agents on Ai is the unique global maximum of P^ 
whenever the graph G associated to L*^*' is weakly connected. 
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Proof: According to the second form of ([5]), Pl reaches its global maximum when yj — yk for all 
j, k for which Ujk 7^ 0- If G is weakly connected, this equality propagates through the whole graph 
such that 2/1=2/2 = •■■ = UN- D 



Proposition 6 Consider N agents on a manifold M satisfying A s sumptions ll\ and\^ Given an 
undirected graph G, a local maximum of the associated cost function Phiu) necessarily corresponds 
to a consensus configuration and a local minimum of Pl {y) necessarily corresponds to an anti- 
consensus configuration. 



Proof: The proof is given for local maxima; it is strictly equivalent for local minima. For y* — 
(2/^...2/7v) to be a local maximizer of Pl, 2/fc must be, for each k, a local maximizer of Pk{c) ■= 
PL{yl--yl-i^ yl^^...ylj). Since A = , pk takes the linear formpfe(c) = 6 + 1^ (Ej ^jfe Vj) 
with ^fe constant V/c. Thanks to Assumption [2l all local maxima of Pk{c) are global maxima. 
Therefore, 2/^ is a global maximum of pj,(c) for all fc, which corresponds to Definition|3]of consensus. 
□ 



Proposition [H] establishes that a .sufficient condition for (anti-)consensus configurations is to 
optimize Pl. However, nothing guarantees that this is also necessary. In general, optimizing Pl 
will thus provide proven (anti-)consensus configurations, but not necessarily all of them (this is 
because consensus maximizes Pl on for only moving one agent with others fixed, and not 
along directions of combined motion of several agents). The remaining sections of this paper 
present algorithms that drive the swarm to (anti-)consensus. Being based on the optimization 
of Pl, these algorithms do not necessarily target all possible (anti-)consensus configurations. For 
instance, for a tree, maximization of Pl always leads to synchronization, although other consensus 
configurations can exist. 

5.1 Examples 

On SO{n) and Grass{p,n), Pl with matrix forms for the elements yk becomes 

PUv) ^ ^E^ELiajktr'dceiyJyk) with 2/fc G M"^" Vfc . (11) 



The special orthogonal group Each term QjQk = Qj^Qk is itself an element of SO{n). It 
is actually the unique element of SO{n) translating Qj to Qk by matrix (group) multiplication on 
the right. Hence, on the Lie group SO{n), the order parameter Pl measures the sum of the traces 
of the elements translating connected agents to each other. Observing that the trace is maximal 
for the identity matrix and considering the particular case of 50(2), one can easily imagine how 
the trace of Qj'^Qfccharacterizes the distance between Qj and Qk. This cost function has been 
previously used in [3 [s^l as a measure of disagreement on 5*0(3). 

The Grassmann manifold On Grass{p,n), PT|) can be rewritten as 

PL{y) = ^ Ef=i uLi o^j^ (ELi cos2(0;.,)) 

withqi*^ = i'^ principal angle between yj and yk . This reformulation has previously appeared in 
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6 Gradient consensus algorithms 

The previous sections pave the way for ascent and descent algorithms on P and P^. This paper 
considers continuous-time gradient algorithms, but any descent or ascent algorithm — in partic- 
ular, discrete-time — will achieve the same task; see ^ for extensive information on this subject. 
In the present paper, the gradient is always defined with the canonical metric induced by the 
embedding of M in M™. 

6.1 Fixed undirected graphs 

A gradient algorithm for Pj^ leads to 

yk{t) = 2N^agTad^^j^{PL) , k ^ 1...N , (12) 

where a > (resp. a < 0) for consensus (resp. anti-consensus), i/k denotes the time-derivative of 
agent fc's position and grad^. (/) denotes the gradient of / with respect to yk along M. This 
gradient can be obtained from the gradient in M™, 

gradfc_R™(PL) = 2wJ2jio.jk + akj) Vj , 
by orthogonal projection PTO^rpj^ ^ onto the tangent space to M at yk, yielding V/c 

yk{t) = a Proji^^^fc (Ej(ajfe + 0'kj)yj) = a Projj.^ ^ {^j{ajk + akj){yj - yk)) ■ (13) 

The last equality comes from Proj^^^ j,(yfe) = 0. It shows that to implement this consensus 
algorithm, each agent k must know the relative position with respect to itself of all agents j 
such that j k OT k j. Since the information flow is restricted to j k, (jl3[) can only be 
implemented for undirected graphs, for which it becomes 

yk{t) ^ 2a FroiT_^ J, (^Y.f=i o-jkiVj - Vkij , k = l...N. (14) 
In the special case of a complete unit-weighted graph, 

ykit)^2aNProiTM,kiC'e{t)-yk) , k ^ I...N . (15) 

Proposition 7 A group of N agents moving according to on a manifold M satisfying As- 
sumptions\^ and\^ where the graph G associated to A — [ojk] is undirected, always converges to a 
set of equilibrium points. If a < 0, all asymptotically stable equilibria are anti- consensus configu- 
rations for G. If a > 0, all asymptotically stable equilibria are consensus configurations for G (in 
particular, for the equally-weighted complete graph, the only asymptotically stable configuration is 
synchronization). 

Proof: A4 being compact and the Ojk bounded, Pl is upper- and lower-bounded. Pl is always 
increasing (decreasing) for a > (a < 0) along solutions of ([14]), since 

Pl = Efe2)fgradfe,^(PL) = 2N'aj:jgrad^^^{PL)f . 

By LaSalle's invariance principle, the swarm converges towards a set where P^ = 0, implying 
gradj, _;^(Pl) = <^ y/t = Vfc and the swarm converges to a set of equilibria. For a > 
(a < 0), since Pl always increases (decreases) along solutions, only local maxima (minima) can 
be asymptotically stable. Proposition [5] states that all local maxima (minima) of P^ correspond 
to consensus (anti-consensus). □ 

Remark 1 Computing gradj, ^ directly along the manifold, as in f^], can be much more efficient 
if the dimension of M is substantially lower than m (see %6.3). 
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6.2 Extension to directed and time- varying graphs 

Formally, algorithm (jl4p can be written for directed and even time- varying graphs, although the 
gradient property is lost for directed graphs and has no meaning in the time- varying case (since Pl 
then explicitly depends on time). Nevertheless, the general case of (fT4|) with varying and directed 
graphs still exhibits synchronization properties. 

It can be shown that synchronization is still a stable equilibrium; it is asymptotically stable 
if disconnected graph sequences are excluded. Its basin of attraction includes the configurations 
where all the agents are located in a convex set of A4. Indeed, convergence results on Euclidean 
spaces can be adapted to manifolds when agents are located in a convex set (see e.g. [s^l)- On 
the other hand, examples where algorithm (|14p with a > runs into a limit cycle can be built for 
as simple cases as undirected equally- weighted (but varying) graphs on the circle (see §6.3). 

Simulations on SO{n) and Grass{p, n) seem to indicate that for randomly generated digraph 
sequence^, the swarm eventually converges to synchronization when a > 0; this would correspond 
to generic convergence for unconstrained graphs. 

Algorithm (fT4|) can lead to a generalization of Vicsek's phase update law (see [i^) to manifolds. 
The Vicsek model is a discrete-time algorithm governing the headings of particles in the plane, 
and hence operates on the circle. It can be written as 

Vkit -f 1) e IMI {{y,m --km Git)} U {ykit)}) , k = 1...N , (16) 

with the definitions introduced in the present paper; interconnections among particles depend on 
their relative positions in the plane (so-called "proximity graphs"). Vicsek's law can be directly 
generalized in the form (fT6|) to any manifold satisfying Assumption [1] Based on the previous 
discussions, it is clear why can be viewed as a discrete-time variant of When run 

asynchronously on a fixed undirected graph, (|16p is an ascent algorithm for P^; see (37] for a 
precise relationship between the continuous-time and discrete-time consensus algorithms on the 
circle. 



6.3 Examples 



Consensus on the circle is studied in [42; l37|; |39|; |43|; the other algorithms presented here are 
original. 



The circle Denoting angular positions by dk, the specific form of (|14p for is 

9k = a' Ef=i <^jk sin{9k - 9,) , k = I...N . (17) 

For the equally-weighted complete graph, this is strictly equivalent to the Kuramoto model [2^ 
with identical (zero) natural frequencies. 

Algorithm (|17p can run into a limit cycle for varying graphs. Consider a regular consensus 
state for an equally-weighted ring graph Gi, with consecutive agents separated by x < ""/S (local 
maximum of Pli)- Define G2 by connecting each agent to the agents located at an angle ^ > tt/2 
from itself with ij: properly fixed. G2 is a collection of disconnected ring graphs and the swarm is 
at a local minimum of . Starting the system in the neighborhood of that state and regularly 
switching between Gi and G^, the system will oscillate in its neighborhood, being driven away by 
G2 and brought back by Gi if consensus is intended and reversely if anti-consensus is intended. 



^More precisely, the following distibution was examined: initially, each element ajfc independently takes a value 
in {0, 1} according to a probability Prob(l) = p. The corresponding graph remains for a time tg^aph uniformly 
distributed in [tmimtmax], after which a new graph is built as initially. 
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The special orthogonal group The tangent space to SO{n) at the identity /„ is the space 
of skew-symetric n x n matrices. By group niuhiplication, the projection of _B G R"^" onto 

the tangent space to SO{n) at Qk is QkSkew{Q^^B) = Qk - -^^). This leads to the 

following explicit form of algorithm (jl4p on SO{n), where the right-hand side only depends on 
relative positions of the agents with respect to k: 

Qk'Qk = « a,fc {Q^'Qj - Q-'Qk) , k = l...iV . (18) 

Using Lemma [T] in the appendix, the following proves that SO{n) satisfies Assmnption [2] It also 
includes the proof of Proposition [TJ 

Proposition 8 The manifold SO(n) satisfies Assumptions^ 

Proof: (+ PropH]) Consider a linear function /(Q) = trace(O^S) with Q e SO{n) and B G R""""; 
gradjj„XTi(/) = B so gradso{n){f) — %{Q^ B — B^Q). Since Q is invertible, critical points of / 
satisfy {Q^B — B^Q) = 0, meaning that they take the form described by Lemma [TJ Using 
notations of Lemma [l] write R = HKH^ where A contains the (non- negative) eigenvalues of R. 
This leads to 

Q - UHJH^ Q^B = HJAH^ ^ f{Q) - ~E'j=i + T.%i+i Ajj ■ 

If Z > 2, select any m G [2,Z] and define = UHJAH^ where A is the identity matrix except 
that A{1, 1) = A{m, m) = cos(e) and A{1, m) ~ —A{ra^ 1) = sin(e) with e arbitrarily small. It is 
straightforward to see that f{Qs) > f{Q) unless An = Amm = 0. Similarly, if / = 1 and 3 m>2 
such that A,nm < All, then f{Qs) > f{Q) with and A defined as previously. Therefore, 

1. if det(i3) > 0, local maxima require I — such that Q = U and f{Q) is the sum of the 
eigenvalues of R; 

2. if det(i?) < 0, local maxima require U to take the form of Lemma [T] with I ~ 1 and 
All ^ AmmVm; thus the first column of H corresponds to a smallest eigen- value of R and 
f{Q) is the sum of n — 1 largest eigenvalues minus the smallest one. 

This shows that all maxima of f{Q) are global maxima (since they all take the same value) 
and, with B = Ce, characterizes the lAM. □ 



The Grassmann manifold The projection of a matrix M G S+ onto the tangent space to 
Grass(j), n) at 11^ is given in [2§| as IlfcA/IIxfc + IIxfe-MIIfc. This leads to 

rifc = 2a ajk {UkUjU^k + U^kHjUk) , k = 1...N . (19) 

In practice, the basis representation is handier than 11^ since it involves smaller matrices. 
Computing the gradient of PLii^k, k = l...iV}) = PL{{YkY^ , k = 1...N}) directly on the 
quotient manifold as explained in |l| leads to the algorithm 

Yk = 4a {Y, M^.u - Ffc MJ.^Mj.u) , k = I...N , (20) 

where thepxp matrices Alj.^ are defined as Afj.fc = Yj^Y}^. For theoretical purposes, the projector 
representation is an easier choice, as for the following proofs. 



Proposition 9 The Grassmann manifold satisfies Assumptions^ 
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Proof: (+ Propl21) Consider a linear function /(H) = trace(n^i?) where i? G §+ and 11 represents 
y G Grass{p, n); grad^„x»i(/) — B so grad(3r„^s(p „) (/) = IIBII^ +nj__Bn. The ranges of the first 
and second terms in gradg^^^^^p «)(/) a-^e at most y and its orthogonal complement respectively, 
so they both equal zero at a critical point y* , such that 3^* is an invariant subspace of B. In 
an appropriate basis (ei...e„), write 11* = diag(l, ...1,0, ...0) and B = diag(/xi, .../ip, /ip+i.../i„). If 
3 d < p and / > p such that pd < then any variation of 11* rotating towards ei strictly 
increases /(H). Therefore, at local maxima of /(II), the p-dimensional space corresponding to 
n must be an eigenspace of B corresponding to p largest eigenvalues. This implies that at any 
local maximum, /(II) equals the sum of p largest eigenvalues of B, so Assumption [2] is satisfied. 
Replacing B by Ce proves Proposition [2l □ 



7 Consensus algorithms with estimator variables 

Section 6 derives algorithms that lead to a consensus situation linked to the interconnection graph. 
But in many applications, the interconnection graph is just a restriction on communication possi- 
bilities, under which one actually wants to achieve a consensus for the complete graph. Moreover, 
allowing directed and time- varying communication graphs is desirable for robustness. This section 
presents algorithms achieving the same performance as those of §6 for the equally-weighted com- 
plete graph — that is, driving the swarm to synchronization or to a subset of the anti-consensus 
configurations for the equally-weighted complete graph which seems to contain little more than 
balancing — under very weak conditions on the actual communication graph. However, this re- 
duction of information channels must be compensated by adding a consensus variable Xk G K™, 
which interconnected agents are able to communicate, to the state space of each agent. 

7.1 Synchronization algorithm 

For synchronization purposes, the agents run a consensus algorithm on their estimator variables Xk 
in K™, k = 1...N, initialized arbitrarily but independently and such that they can take any value 
in an open subset of M™; Vfc, agent fc's position yk on A4 independently tracks (the projection on 
M of) Xk- This leads to 

ifc = /3 J2j Ojfe {xj ~ Xk) , P > (21) 

Vk = 7s gradj._^(y^Xfc) = 7s Projy_;K_fc(a;fc) , 7s > , k^l...N. (22) 

Equation pTjl is a classical consensus algorithm in M™ , where Xk (t) points from Xk (t) towards the 
centroid of the (appropriately weighted) Xj (t) for which j k at time t. According to [33; 32; , 
if the time-varying communication graph G{t) is piecewise continuous in time and uniformly 
connected, then all the Xk exponentially converge to a common consensus value Xoc', moreover, if 
G{t) is balanced for all t, then Xoo = (i-^- ^oo is the centroid of the initial Xk)- This 

implies the following convergence property for ((2T|) . (|22p . where the notation lAMg generalizes the 
definition ([3]) of the lAM when the points defining Ce are not on M. 

Proposition 10 Consider a piecewise eontinuous and uniformly connected graph G{t) and a man- 
ifold M. satisfying Assumptions [7] and [H The only stable limit configuration of the yk under 
k21]) . ^2^) . with the Xk initialized arbitrarily but independently and such that they can take any 
value in an open subset of , is synchronization at yoo — Proj-p^ ^(a;oo); if G{t) is balanced, 
y^^IMdg{xk{Q),k = \...N}. 

Proof: Convergence of ((2T|l towards Xk = x^ n VA: is proved in [s^l; the property x^o — J2k 

for balanced graphs is easy to check (see [35j). As a consequence, the asymptotic form of (PT|) . (|22p 
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is a set of N independent systems 

Xk ^CXD (^^) 

Vk = 7sProj-r^_fc(xoo) , k = 1...N , (24) 

where Xoo is a constant. According to ^30], the w-hmit sets of the original system (PT|) . (|22p 
correspond to the chain recurrent sets of the asymptotic system ((23)) . (p4|) . The first equation is 



trivial. According to Proposition 4 in 2l| and Sard's theorem, since (|24|) is a gradient ascent 
algorithm for f{yk) = Uk Xoo and f{yk) is smooth (as the restriction of a smooth function to the 
smooth embedded manifold Ai), the chain recurrent set of ([24]) is equal to its critical points. Since 
Xoo is a linear combination of the Xk{0), variations of the Xk(0) are equivalent to variations of Xoo- 
Property o: Any open neighborhood O of any point Xo G K™ contains a point Xa for which 
fiyk) lias a unique (local = global, by Assumption [2]) maximizer. 

Proof: If y]^ Xo has multiple maximizers, select one of them, call it y^. Then for > 0, y^a^o + 
yly* < yjxo + <y yly* < y'^Xo + cr with equahty holding if and only if yk = y*, so is the 
unique maximizer of (xo + cry* ) . Since any open neighborhood O of Xo contains points of the 
form Xa = Xo + cry*, cr > 0, property o is proved. 

Because of Property o, with respect to variations of the Xfe, the situation "/(y/c) has multiple 
maximizers" is unstable. The situation "/(yfc) has a unique maximizer" is stable since it corre- 
sponds to a non-empty open set in R™; thus a convex neighborhood of be found in which 
the Xk{t) will stay by convexity of (|2ip and where fiyk) has a unique maximizer. With respect 
to variations of the y^, the (thus unique) maximizer of y'^ Xk is the only stable equilibrium for 
gradient ascent algorithm (|22p . such that for Xk — > Xoo the only stable situation is synchronization. 
□ 



7.2 Anti-consensus algorithm 

For anti-consensus, in analogy with the previous section, each yk evolves according to a gradient 
algorithm to maximize its distance to Xk(t). If Xk(t) asymptotically converges to Ce{t), this 
becomes equivalent to the gradient anti-consensus algorithm ()15|) . Imposing Xk{Q) = yk{0) Vfc, the 
following algorithm achieves this purpose when G{t) is balanced Vt: 

Xk = P SjLi <^jk (xj - Xk) +yk , P > (25) 

Vk = 7i3gradfc^^(y^Xfc) = 7sProjT7W,fc(a;fc) , 75 < , k = l...N. (26) 

Note that the variables Xk and yk are fully coupled; in a discrete-time version of this system, this 
essential feature of the algorithm must be retained in the form of implicit update equations in 
order to ensure convergence (see Q for details). 

Proposition 11 Consider a piecewise continuous, uniformly connected and balanced graph G{t) 
and a manifold M satisfying Assumptions^^ and\^ Then, algorithm I125\ ). l2Bfl with initial condi- 
tions Xfc(O) = yfe(O) Vfc converges to an equilibrium configuration of the anti-consensus algorithm 
for the equally-weighted complete graph, that is \15\) with a < 0. 

Proof: First show that jjj^k^kit) ~ J^kVkit) — Ce{t). Since Xk{0) = yk(0) Vfc, it is true for 
t — 0. Thus it remains to show that Xk{t) = yk{t)- This is the case because a balanced 
graph ensures that the first two terms on the right side of the following expression cancel each 
other: 

EfcifeW = PJ2j (EfeOifc) ^3 - PJ2k (EjOj-fc) + 'EkVkit) ■ 

Next, prove that Vfc, yk{t) is a uniformly continuous function in ^2(0, -|-oo) such that Barbalat's 
Lemma implies yk 0. First show that W{t) ~ ^^k-'^k{t)^Xk{t) is never increasing along the 
solutions of ([25 ]) . (126]) . Denoting by {x)j, j — l...m, the vectors of length TV containing the j-th 
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component of every x^, k — 1...N and by L^'^ the in-Laplacian of the varying graph associated to 
the ajk, one obtains 

The term containing L^*) is non-positive because the Laplacian of balanced graphs is positive semi- 
definite (see Replacing yk from and noting that 
xl ProjryK_fc(a;fc) = (ProjTAi,fc(a;fe)) Proj,.,^ ^(xfc), one obtains 

W{t) = 7B Efe \\P^oiTM,ki^k)r - /3 ix)jL^'\x), < . (27) 

Thus W{t) < W{0) — Y which implies that each yk{t) is in £2(0, -l-oo) since 

W{t) < W{0) also implies that Xk is uniformly bounded Vfc; from (|26p . yk is uniformly bounded as 
well. Combining these two observations, with the ajk bounded, (j25p shows that Xk has a bounded 
derivative and hence is Lipschitz in t Vfc. Now write 

\\yk{xk{ti),yk{ti)) - yk{xk{t2),yk{t2))\\ < 

\\yk{xk{ti),yk{ti)) - yk{xk{t2),yk{ti))\\ + \\yk{xk{t2),yk{ti)) - yk{xk{t2),yk{t2))\\ ■ 

The first term on the second line is bounded by ri 1^1 — ^2! for some ri since ijk is linear in Xk and 
Xk is Lipschitz in t. The second term on the second line is bounded by r2 \ti — <2| for some ^2 since 
yk is Lipschitz in yk (as the gradient of a smooth function along the smooth manifold A4) and 
'^{yk) = ilk is uniformly bounded. Hence, i/k is Lipschitz in t and therefore uniformly continuous 
in t, such that Barbalat's Lemma can be applied. Therefore ijk 0. Thus from [s^, the w-limit 
sets of (PT|) . (P^ correspond to the chain recurrent sets of the asymptotic system 

ife = /3 Hj «jfe " ^k) 
= 7BProjy^ ;^,(xfe) . 

The second line is just a static condition. The chain recurrent set of the linear consensus algorithm 
in the first line reduces to its equilibrium set Xk — Xoo Vfc. But then, from the beginning of the 
proof, Xk = Ce Vfc such that the static condition becomes Q — Projj^^ fc(C'e) Vfc. This is the 
condition for an equilibrium of anti-consensus algorithm (jlSp with 73 = 2aN. □ 

In simulations, a swarm applying (|25p . (|26[) with 2:^(0) — yk{0) Vfc seems to generically con- 
verge to an anti-consensus configuration of the equally-weighted complete graph, that is a stable 
equilibrium configuration of (|15p with a < 0. 

7.3 Examples 

Applying this strategy to the circle yields the results of [39], the Xk reduce to vectors of R-^; 
algorithms (|22p and (|26p respectively drive the yk towards and away from the central projection 
of Xk onto the unit circle. 

The special orthogonal and Grassmann manifolds The particular balancing algorithms 
will not be detailed as they are directly obtained from their synchronization counterparts. Intro- 
ducing auxiliary n x n-matrices Xk , (|2ip may be transcribed verbatim. Using previously presented 
expressions for Proj-p_yy^ ki-^k), (EH) becomes 

OnSOin): Q-'Qk = '-f {QlXk - XlQk) , k=\...N. (28) 

On Gmss(p, n) : life = 7s (nfeX^n^fe + n^feXfeHfe) , k^l...N. (29) 

Note that for Grass{p,n), the projector representation must be used in pT|) and ((25)) . such that 
using n X n matrices Xk becomes unavoidable. 
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7.4 Remark about the communication of estimator variables 

To implement the algorithms of this section, interconnected agents must communicate the values 
of their estimator variable Xk- It is important to note that the variables Xk may not just be a set of 
abstract scalars for each agent k: since Xk interacts with the geometric yk, it must be a geometric 
quantity too. However, the x^ evolve in R™ while the original system lives on A4; the relative 
position of agents on is a meaningful measurement, but nothing ensures a priori that a similar 
thing can be done in R™ . A solution could be to use a common (thus external) reference frame in 
R™ and transmit the coordinates of the Xk in this frame. That solution would unfortunately imply 
that the swarm loses its full autonomy; however, the external frame is just used for "translation" 
purposes and does not interfer with the dynamics of the system. 

When A4 is (a subgroup of) SO{n), the algorithms can be reformulated such that they work 
completely autonomously if interconnected agents measure their relative positions Q^Qj- Indeed, 
define Zk — Qj^Xk- Then (PT|) . (P^ for instance becomes 

Zk = {QlQkfZk + p {{QlQj)Zj - Zk) (30) 

QlQk = f (Zfc-Zj) ,fc=l...iV. (31) 

In this formulation, each agent k can represent Zk as an array of scalars, whose columns express 
the column-vectors of Xk as coordinates in a local frame attached to k (i.e. in a frame rotated by 
Qk with respect to a hypothetical reference frame). Pre- multiplying Zj by QJQj expresses Xj in 
the local frame of k, and QkQk expresses the velocity of Qk (with respect to a hypothetical fixed 
reference) in the local frame of k as well. Thus ([50]) , (|3ip actually corresponds to (PT|) , written 
in the local frame of k. Each agent k gets from its neighbors j k their relative positions QjQj 
and the n x n arrays of numbers Zj] from this it computes the update Zk to its own array of 
numbers Zk and the move it has to make with respect to its current position, QkQk- The same 
can be done for the anti-consensus algorithm. 



8 Conclusion 

The present paper makes three main contributions. 

First, it defines the induced arithmetic mean of N points on an embedded connected compact 
homogeneous manifold A4; though it differs from the traditional Karcher mean, it has a clear 
geometric meaning with the advantage of being easily computable — see analytical solutions for 
SO{n) and Grass{p,n). 

Secondly, a definition of consensus directly linked to the induced arithmetic mean is pre- 



sented for these manifolds. In particular, the notion of balancing introduced in [4^ for the circle 
is extended to connected compact homogeneous manifolds. Consensus for the equally-weighted 
complete graph is equivalent to synchronization. Likewise, it appears in simulations that anti- 
consensus for the equally- weighted complete graph leads to balancing (if N is large enough), even 
though this could not be proved. 

Thirdly, consensus is formulated as an optimization problem and distributed consensus algo- 
rithms are designed for N agents moving on a connected compact homogeneous manifold. In a first 
step, gradient algorithms are derived for fixed undirected interconnection graphs; (anti-)consensus 
configurations are their only stable equilibria. Similar algorithms are considered when the graph 
is allowed to be directed and/or to vary, but their convergence properties are mostly open. In a 
second step, the algorithms are modified by incorporating an estimator variable for each agent. 
In this setting, convergence to the (anti-)consensus states of the equally-weighted complete graph 
can be established theoretically for time- varying and directed interconnection graphs. The mean- 
ingful way of communicating estimators between agents remains an open issue when A4 is not a 
subgroup of SO(n). 

Running examples SO{n) and Grass(j), n) illustrate the validity of the discussion and provide 
geometric insight. The models and results obtained by applying this framework to the circle are 
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strictly equivalent to existing models and results (most significantly in [i^l , [i^l , [s^ ) ■ This draws 
a link from the present discussion to the vast literature about synchronization and balancing on 
the circle. 



9 Appendix 

Lemma 1 // g{Q) = Q^B - B^Q with Q e SO{n) and B e R"""", then g{Q) = Q iff Q = 
UHJH^, where B — UR is a polar decomposition of B, the columns of H contain (orthonormal- 
ized) eigenvectors of R and 

J _ f -Ii \ I even if det([/) > 

\ /„_; y ' I odd if det(C/) < 

Proof: All matrices Q of the given form obviously satisfy that B is symmetric. The following 
constructive proof shows that this is the only possible form. 

Since U^B — R is symmetric with U S 0{n), the problem is to find all matrices T = U^Q G 
0{n) such that S — T^R is symmetric and det(T) = det(C/). Work in a basis of eigenvectors H* 
diagonalizing R with its eigenvalues placed in decreasing order Ai > A2... > A„ > 0. The following 
shows that T is diagonal in that basis. Then orthogonality of T imposes values 1 or —1 on the 
diagonal, the number I of —1 being compatible with det(r) = det(J7); the final form follows by 
returning to the original basis and reordering the eigenvectors such that those corresponding to 
— 1 are in the first columns. 

The j"* column of S is simply the j"* column of T multiplied by Xj. Therefore: 

1. If Xi = Xj, then H* may be chosen such that the corresponding submatrix T{i : j, i : j) ~ 
intersection of rows i to j and columns i to j of T is diagonal. 

2. If Ap+i = and Xp /= 0, then S symmetric implies T(n — p : n, 1 : p) = 0. As T(n — p : 
n, n ~ p : n) is diagonal from 1., only diagonal elements are non-zero in the last n — p rows 
of T. Rows and columns of T being normalized, T{1 : p, n ~ p : n) — 0. 

3. Consider i- < p and i+ the smallest index such that A^^ < Xi_ . Note that 

= - 1 (orthogonahty) and Y., ^L, = Sl_ (symmetry). (32) 

Start with i^ — 1 and assume Xi_^ > 0. (|32p can only be satisfied if Tjk = T^j = Vj > i+ and 
Vfc G [«.«+); 1. further implies Tjk — Tkj — Vj ^ k and Vfc e [z_,i+). This argument is repeated 
by defining the new i^ as being the previous «+ until A^^ — (case 2.) or A,;_ — A„ > 0. This 
leaves T diagonal. □ 
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